Summary of Text Categorization based on Maximum Entropy Model
نویسندگان
چکیده
Since 1990s, the maximum entropy model has been used in text categorization and achieves good results in Natural Language Processing since its framework and algorithm were established. On the basis of the Maximum Entropy Model, scholars improve it and make a more in-depth study. Using Maximum Entropy Model for text sentiment categorization has become a hot research topic in recent years. In this paper, the application of Maximum Entropy Model in text categorization is analyzed and classified into three categories: text categorization based on the original Maximum Entropy Model, text categorization based on improved Maximum Entropy Model and text emotion categorization based on Maximum Entropy Model. The authors consider that the existing text categorization using Maximum Entropy Model is to classify the text into a certain category directly, but not to give the probability that a text belongs to a category based on characteristic of Maximum Entropy Model. Therefore, the future research focus will be on the use of the maximum entropy model in t fuzzy classification information.
منابع مشابه
A Survey Paper On Naive Bayes Classifier For Multi-Feature Based Text Mining
Text mining is variance of a field called data mining. To make unstructured data workable by the computer Text mining is used which is also referred as “Text Analytics”. Text categorization, also called as topic spotting is the task of automatically classifies a set of documents into groups from a predefined set. Text classification is an essential application and research topic because of incr...
متن کاملA Summary Writing Model Based on Van Dijk’s Concept of Macrostructure and its Application within the Genre-Based Approach
This study was an attempt to provide a comprehensive model for summary writing based on the model of Van Dijk’s concept of macrostructures. The effectiveness of the model was examined in a genre-based quasi-experimental study with the data collection procedure lasting a semester. The participants included 60 female English learners divided into two experimental and control groups. The results o...
متن کاملEvaluation and Extension of Maximum Entropy Models with Inequality Constraints
A maximum entropy (ME) model is usually estimated so that it conforms to equality constraints on feature expectations. However, the equality constraint is inappropriate for sparse and therefore unreliable features. This study explores an ME model with box-type inequality constraints, where the equality can be violated to reflect this unreliability. We evaluate the inequality ME model using text...
متن کاملThe Impact of Summary Writing with Structure Guidelines on EFL College Students’ Rhetorical Organization: Integrating Genre-Based and Process Approaches
This study aimed at investigating the impact of writing on Iranian EFL college students’ rhetorical organization. Thirty Iranian female undergraduate students majoring in English at Al-zahra University participated in the current study. The writing instructions included two stages, each lasting for four weeks. The participants were assigned to a control group and an experimental group according...
متن کاملImproving the Operation of Text Categorization Systems with Selecting Proper Features Based on PSO-LA
With the explosive growth in amount of information, it is highly required to utilize tools and methods in order to search, filter and manage resources. One of the major problems in text classification relates to the high dimensional feature spaces. Therefore, the main goal of text classification is to reduce the dimensionality of features space. There are many feature selection methods. However...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2017